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o 

Cn Abstract. Previous work by Demaine et al. (2012) developed a strong 

^.^ connection between smallest context-free grammars and staged self-assembly 

d systems for one-dimensional strings and assemblies. We extend this work 

^TJ to two-dimensional polyominoes and assemblies, comparing staged self- 

^^ assembly systems to a natural generalization of context-free grammars 

T— ( we call polyomino context-free grammars (PCFGs). 

We achieve nearly optimal bounds on the largest ratios of the smallest 

U' ' PCFG and staged self-assembly system for a given polyomino with n 

cells. For the ratio of PCFGs over assembly systems, we show that the 
\^ smallest PCFG can be an J?(n/(logn)'^)-factor larger than the smallest 

f/2 staged assembly system, even when restricted to square polyominoes. For 

O the ratio of assembly systems over PCFGs, we show that the smallest 

staged assembly system is never more than a 0(logn)-factor larger than 
-vj the smallest PCFG and is sometimes an i?(logn/loglogn)-factor larger. 

> 

oo 

en 1 Introduction 

o 

In the niid-1990s, the Ph.D. thesis of Erik Winfree [57] introduced a theoretical 

^i model of self-assembling nanoparticles. In this model, which he called the abstract 

-^ tile assembly model (aTAM), square particles called tiles attach edgewise to each 

^-H other if their edges share a common glue and the bond strength is sufficient 

ILJ to overcome the kinetic energy or temperature of the system. The products of 

these systems are assemblies: aggregates of tiles forming via crystal- like growth 
starting at a seed tile. Surprisingly, these tile systems have been shown to be 
computationally universal |27l5j , self-simulating |12ll3j , and capable of optimally 
encoding arbitrary shapes |23lll26j . 

In parallel with work on the aTAM, a number of variations on the model 
have been proposed and investigated. These variations change a number of fea- 
tures of the original aTAM, for instance modifying tile shape J17I14I8| . allowing 
glues to repulse [11 22 24 , or adding labels to each tile to produce patterned 
assemblies |19)20(15p6f25] . For a more thorough treament of the aTAM and its 
variants, see the recent surveys of Patitz [21] and Doty [TO] . 

One well-studied variant called the hierarchical or two-handed assembly model 
(2HAM) eliminates the seed tile and allows tiles and assemblies to attach in 
arbitrary order. This model was shown to be capable of (theoretically) faster 
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assembly of squares [4] and simulation of aTAM systems [2, , including captur- 
ing the seed-originated growth dynamics. A generalization of the 2HAM model 
proposed by Demaine et al. [7] is the staged assembly model, which allows the 
assemblies produced by one system to be used as reagents (in place of tiles) 
for another system, yielding systems divided into distinct assembly stages. They 
showed that this staging can replace the role of glues in encoding complex assem- 
blies, allowing the construction of arbitrary shapes efficiently while only using a 
constant number of glue types, a result impossible in the aTAM or 2HAM. 

To understand the power of the staged assembly model, Demaine et al. [S] 
studied the problem of finding the smallest system producing a one-dimensional 
assembly with a given sequence of labels on its tiles, called a label string. They 
proved that for systems with a constant number of glue types, this problem is 
equivalent to the well-studied problem of finding the smallest context-free gram- 
mar whose language is the given label string, also called the sm,allest gram,m,ar 
problem, (see 18'3]). For systems with unlimited glue types, they proved that the 
ratio of the smallest context-free grammar over the smallest system producing 
an assembly with a given label string of length n (which they call separation) is 
Q{^Jn/\ogn) and 0{{n/\ogn)^/^) in the worst case. 

In this paper we consider the two-dimensional version of this problem: finding 
the smallest staged assembly system producing an assembly with a given label 
polyomino. For systems with constant glue types and no cooperative bonding, 
we achieve separation of grammars over these systems of J7(n/(loglogn)^) for 



polyominoes with n cells (Sect. 6.1), and i7(n/(logn)'') when restricted to rect 



angular (Sect. 6.2 1 or square (Sect. 6.3) polyominoes with a constant number of 



labels. Adding the restriction that each step of the assembly process produces 
a single product, we achieve i7(n/(logn)^) separation for general polyominoes 
with a single label (Sect. [6J] ). For the separation of staged assembly systems over 
grammars, we achieve bounds of ^(logn/loglogn) (Sect.l4| and, constructively, 
O(logn) (Sect. Is]). For all of these results, we use a simple definition of context- 
free grammars on polyominoes that generalizes the deterministic context-free 
grammars (called RCFGs) of |9]. 

When taken together, these results give nearly complete picture of how small- 
est context-free grammars and staged assembly systems compare. For some poly- 
ominoes, staged assembly systems are exponentially smaller than context-free 
grammars (O(logn) vs. J7(n/(logn)^)). On the other hand, given a polyomino 
and grammar deriving it, one can construct a staged assembly system that is a 
(nearly optimal) 0(logn)-factor larger and produces an assembly with a label 
polyomino replicating the polyomino. 



2 Staged Self-Assembly 

An instance of the staged tile assembly model is called a staged assembly system 
or system, abbreviated SAS. A SAS S = {T, G, r, M, B) is specified by five parts: 
a tile set T of unit square tiles, a glue function G : E{GY -^ {0, 1, . . . ,r}, a 
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temperature t G N, and a directed acyclic mix graph M = {V^E), and a start 
bin function B : Vl ^ T from the leaf vertices V^ C V oi M. 

Each tile i G T is specified by a 5-tuple {I, g^ ge^ Qs, gw) consisting of a label 
I taken from an alphabet S{T) (denoted l{t)) and a set of four non-negative 
integers in S{G) = {0, 1, . . . , A;} specifying the glues on the sides of t with 
normal vectors (0,1) (north), (1,0) (east), (0,-1,) (south), and (—1,0) (west), 
respectively, denoted gu(t). In this work we only consider glue functions with 
the constraints that G{gi,gj) — G{gj,gi), if G{gi,gj) > then gi — gj, and 
^(0,5^) = for any gj. 

A configuration is a a partial function C : iP' ^ T mapping locations on 
the integer lattice to tiles. Any two locations pi ~ {xi,yi), p2 — (2^2,2/2) in 
the domain of C (denoted dom(C)) are adjacent if ||p2 — Pi|| — 1 and the 
bond strength between any pair of tiles C(pi) and C{p2) at adjacent locations 
is G(.gp2_pj^(C(pi)), 5pj_p2(C(p2))- A configuration is a r-stable assembly or an 
assembly at temperature t if dom(C) is connected on the lattice and, for any 
partition of dom(C) into two subconfigurations Ci, C2, the sum of the bond 
strengths between tiles at pairs of locations pi E dom(Ci), p2 G dom(C2) is at 
least T. Any pair of configurations Ci, C2 are equivalent if there exists a vector 
V = {x, y) such that dom(Ci) — {p + v \p € dom(C2)} and Gi{p) — C2{p + v) 
for all p e dom(Ci). Two r-stable assemblies Ai, A2 are said to assemble into 
a superassembly A3 if there exists a translation vector v ~ (x, y) such that 
dom(v4i) C\\j)^v\peA2\ = % and A3 defined by the partial functions A\ and 
A!^ with A!2(j)) = A2{p + v) is a r-stable assembly. 

Each vertex of the mix graph M describes a two-handed assembly process. 
This process starts with a set of r-stable input assemblies I. The set of assembled 
assemblies Q is defined recursively as / C Q, and for any pair of assemblies 
Ai,A2 G Q with superassembly A3, A3 G Q. Finally, the set oi products P QQ 
is the set of assemblies A such that for any assembly A' , no superassembly of A 
and A' is in Q. 

The mix graph M = {V, E) of S defines a set of two-handed assembly pro- 
cesses (called mixings) for the non-leaf vertices of M (called bins). The input 
assemblies of the mixing at vertex v is the union of all products of mixings at 
vertices v' with (u',f) G E. The start bin function B defines the lone single-tile 
product of each mixings at a leaf bin. The system S is said to produce an as- 
sembly A if some mixing of S has a single product, A. We define the size of 5, 
denoted S, to be \E\, the number of edges in M . If every mixing in a 5 has a 
single product, then 5 is a singular self-assembly system (SSAS). 



The results of Section 6.4 use the notion of a self-assembly system S' simu- 
lating a system S by carrying out the same sequence of mixings and producing a 
set of scaled assemblies. Formally, we say a system S' — {T' , G' , r, M') simulates 
a system S = {T,G,t, M) at scale b if there exist two functions /, g with the 
following properties: 

(1) The function / : {S{T') U 0)''' -^ E(T) U maps the labels of 6 x 6 regions 
of tiles (called blocks) to a label of a tile in T. The empty label denotes 
no tile. 
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Fig. 1. A self-assembly system (SAS) consisting of a mix graph and tile types (left), and 
the assemblies produced by carrying out the algorithmic process of staged self-assembly 
(right). 



(2) The function g : S' ^ V maps a subset S' of the vertices of the mix graph 
M' to vertices of the mix graph M such that g is an isomorphism between 
the subgraph induced by 5" in M' and the graph M. 

(3) Let P{v) be the set of products of the bin corresponding to vertex i; in a 
mix graph. Then for each vertex v G M with v' — g^^{v), P{v) = {f{p) \ 

pePiv')}. 

Intuitively, / defines a correspondence between the 6-scaled macrotiles in S' 
simulating tiles in S, and g defines a correspondence between bins in the systems. 
Property (3) requires that / and g do, in fact, define correspondence between 
what the systems produce. 

The self-assembly systems constructed in Sections [5] and [6] produce only 
mismatch-free assemhlies: assemblies in which every pair of incident sides of two 
tiles in the assembly have the same glue. A system is defined to be m,ism,atch-free 
if every product of the system is mismatch- free. 

In Section [5] we construct systems that produce assemblies whose label poly- 
ominoes are scaled versions of other polyominoes, with some amount of "fuzz" 
in each scaled cell. A polyomino P' = (S", L') is said to be a (c, d)-fuzzy replica 
of a polyomino P = {S,L) if there exists a vector {xt,yt) with the following 
properties: 



For each block of cells S',^ > = {{x, y) \ Xt + di < x < Xt + d{i + 1), yt + dj < 
y < yt+d{j + l)} (called a supercell), S',^ jjHS" ^ if and only if (i,j) C S. 
For each supercell S',^ ■^ containing a cell of P' , the subset of label cells 
{ix,y) I xt+di+{d-c)/2 <x < xt + d{i + l) + {d-c)/2,yt + dj + {d-c)/2 < 
y < yt+ d{j + 1) + {d— c)/2} consists of c^ cells of P', with all cells having 
identical label, called the label of the supercell and denoted J~-{i,j)- 
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3. For each supercell S',- -, , any cell that is not a label cell of S',- ■■. has a common 
fuzz label in L' . 

4. For each supercell S',^ -,, the label of the supercell C',- ■•. = P{i,j). 

Properties (1) and (2) define how sets of cells in P' replicate individual cells in 
P, and the labels of these sets of cells and individual cells. Property (3) restricts 
the region of each supercell not in the label region to contain only cells with a 
common fuzz label. Property (4) requires that each supercell's label matches the 
label of the corresponding cell in P. 

3 Polyomino Context-Free Grammars 

Here we describe polyominoes, a generalization of strings, and polyomino context- 
free grammars, a generalization of deterministic context-free grammars. These 
objects replace the strings and restricted context-free grammars (RCFGs) of 
Demaine et al. [2j. 

A labeled polyomino or polyomino P — {S, L) is defined by an edge-connected 
set of points S on the square lattice (called cells) containing (0, 0) and a label 
function L : S ^ ^{P) mapping each cell of P to a label contained in an 
alphabet S{P)- The size of P is the number of cells P contains and is denoted 
|P|. The label of the cell at lattice point (x, y) is denoted L{(x, y)) and we define 
P{x,y) = L((x,y)) for notational convenience. In this section we refer to the 
label or color of a cell interchangeably. 

Define a polyomino context-free grammar (PCFG) to be a quadruple G = 
(Z", P, S", A). The set 17 is a set of terminal symbols and the set _r is a set of non- 
terminal symbols. The symbol S' S P is a special start symbol. Finally, the set A 
consists of production rules, each of the form N — > (i?i, (a;i, j/i)) . . . {Rj, {xj, yj)) 
where N ^ F and is the left-hand side symbol of this unique rule, Ri G N UT, 
and each [xi,yi) is a pair of integers. The size of G is defined to be the total 
number of production rules on the right-hand sides of the rules of A. 

A polyomino P can be derived by starting with S*, the start symbol of G, 
and repeatedly replacing a non-terminal symbol with a set of non-terminal and 
terminal symbols. The set of valid replacements is Z\, the production rules of G, 
where a non-terminal symbol N with lower-leftmost cell at (a:, y) can be replaced 
with a set of symbols Ri at (x-\-xi,y-\-yi), R2 at {x-\-X2, y + 2/2), . . . , i?^ at {x-\- 
Xj,y + yj) if there exists a rule iV ^- {Ri,{xi,yi)){R2,{x2,y2)) ■ ■ ■iRj,lxj,yj)). 
Additionally, the set of terminal symbol cells derivable starting with S must be 
connected and pairwise disjoint. 

The polyomino P derived by the start symbol of a grammar G is called the 
language of G, denoted L{G), and G is said to derive P. In the remainder of 
the paper we assume that each production rule has at most two right-hand side 
symbols (equivalent to binary normal form for ID CFGs), as any PCFG can be 
converted to this form with only a factor-2 increase in size. 

Intuitively, a polyomino context-free grammar is a recursive decomposition 
of a polyomino into smaller polyominoes. Because each non-terminal symbol is 
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the left-hand side symbol of at most one rule, each non-terminal corresponds 
to a subpolyomino of the derived polyomino. Then each production rule is a 
decomposition of a subpolyomino into smaller subpolyominoes (see Figure pi) . 
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Fig. 2. Each production rule in a PCFG generating a single shape is a decomposition of 
the left-hand side non-terminal symbol's polyomino into the right-hand side symbols' 
polyominoes. 



In this interpretation, the smallest grammar deriving a given polyomino is 
equivalent to a decomposition using the fewest distinct subpolyominoes in the 
decomposition. As with in one dimension, finding the smallest PCFG for a given 
polyomino is NP-hard, even to approximate [3], and achieving optimal approx- 
imation algorithms remains open f l6j . 



4 SAS over PCFG Separation Lower Bound 

This result uses a set of shapes we call n-stagglers, an example is seen in FigurelSJ 
The shapes consist of log n bars of dimensions n/ log n x 1 stacked vertically 
atop each other, with each bar horizontally offset from the bar above it by some 
amount in the range — (n/logn — 1), . . . ,n/logn — 1. We use the shorthand 
that \ogn — [lognj for conciseness. Every sequence of \ogn — 1 integers, each 
in the range [— (n/logn — l),n/logn — 1], encodes a unique staggler and by 
the pidgeonhole principle, some n-staggler requires log((2n/logn — l)'°g"~i = 
f2(log^ n) bits to specify. 



n/logn = 2^/8 



log 2^ == 8 



Fig. 3. The 2**-staggler specified by the sequence -18, 13, 9, -17, -4, 12, -10. 



Lemma 1. Any n-staggler can he derived by a PCFG of size 0(log7i). 
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Proof. A set 0(log n) production rules deriving a bar (of size 0{n/ log n) x 1) can 
be constructed by repeatedly doubling the length of the bar, using an additional 
\ogn rules to form the bar's exact length. Creating the stack of bars at their 
relative offsets can be described using a single rule with log n r.h.s. symbols. So 
an rt-staggler can be derived by a PCFG of total size 0(log7i). 

Lemma 2. For every n, there exists an n-staggler P such that any SAS or SSAS 
producing an assembly with label polyomino P has size size ^(log n/ log log n). 

Proof. The proof is information-theoretic. Recall that more than half of all n- 
stagglers require i7((log„)^) bits to specify. Now consider the number of bits 
contained in a SAS S. Recall that |5| is the number of edges in the mix graph 
of S. Any SAS can be encoded naively using 0(|5|log|5|) bits to specify the 
mix graph, 0(|r|log|T|) bits to specify the tile set, and 0(|5|log|r|) bits to 
specify the tile type at each leaf node of the mix graph. Because the number 
of tile types cannot exceed the size of the mix graph, |r| < \S\. So the total 
number of bits needed to specify S (and thus the number of bits of information 
contained in S) is 0(|5|log|5| + |T| log |r| + |5|log|5|) = 0(|5| log|5|). So 
some n-staggler requires a SAS S such that 0{\S\ log \S\) = ^(log n) and thus 
\S\ = J7(log^n/loglogn). 

Theorem 1. The separation of SASs and SSASs over PCFGs is /2(log n/ log log n) . 

Proof. By the previous two lemmas, more than half of all n-stagglers require 
SASs and SSASs of size /2(log^n/loglogn) and all n-stagglers have PCFGs of 
size O(logn). So the separation is i7(logn/loglogn). 

We also note that scaling the n-staggler by a c-factor produces a shape which 
is derivable by a CFG of size 0(logn -I- logc). That is, the result still holds for 
n-stagglers scaled by any amount polynomial in n. For instance, the 0(n)-factor 
of the construction of Theorem [2] 

At first it may not be clear how PCFGs achieve smaller encodings. After all, 
each rule in a PCFG G or mixing in SAS S specifies either a set of right-hand 
side symbols or set of input bins to use and so has up to 0(log IG]) or 0(log \S\) 
bits of information. The key is the coordinate describing the location of each 
right-hand side symbol. These offsets have up to O(logn) bits of information 
and in the case that G is small, say O(logri), each rule has a number of bits 
linear in the size of the PCFG! 



5 SAS over PCFG Separation Upper Bound 

Next we show that the separation lower bound of the last section is nearly large 
as possible by giving an algorithm for converting any PCFG G into a SSAS S 
with system size 0(|G| logn) such that S produces an assembly that is a fuzzy 
replica of the polyomino derived by G. Before describing the full construction, 
we present approaches for efficiently constructing general binary counters and of 
simulating glues using geometry. 
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Increment OOllf, by 1, yielding 0100b. 

Fig. 4. A binary counter row constructed using single-bit constant-sized assemblies. 
Dark blue and green glues indicate 1-valued carry bits, light blue and green glues 
indicate 0- valued carry bits. 



The binary counter row assemblies used here are a generalization of those by 
Demaine et al. [7| consisting of constant-sized bit assemblies, and an example is 
seen in Figure [4] Our construction achieves 0{\ogn) construction of arbitrary 
ranges of rows and increment values, in contrast to the contruction of (7) that 
only produces row sets of the form 0, 1, . . . , 2^ — 1 that increment by 1. To do 
so, we show how to construct two special cases from which the generalization 
follows easily. 

Lemma 3. Let i, j, n be integers such that < i < j < ri. There exists a SSAS 
of size O(logn) with a set of bins that, when mixed, assemble a set of j — i + 1 
binary counter rows, each incrementing by 1. 

Proof. Representing integers as binary strings, consider the prefix tree induced 
by the binary string representations of the integers i through j, which we denote 
T(ijy The prefix tree r(o,2™-i) is a complete tree of height m, and the prefix 



tree Tu ,) with < i < j < 2™ - 1 is a subtree of T, 



(0,2--l) 



with j — i -\- 1 leaf 
nodes See Figure [5] for an example with 

Now let n = 2™ — 1. If T(-o^„) is drawn with leaves in left-to-right order by 
increasing integer values, then the leaves of the subtree ^(i j) appear contiguously. 
So the subtree T^ij) has at most 21ogn internal nodes with one child forming 
the leftmost and rightmost paths in T^ij)- Furthermore, if one removes these 
two paths from Tuj), the remainder of Tiij) is a forest of complete trees with 
at most two trees of each height and 2 log n trees total. 

Note that a complete subtree of the prefix tree corresponds to a set of all 
possible 2^ suffixes of length h, where h is the height of the subtree. The leaves 
of such a subtree then correspond to the set of strings of length I with a specific 
prefix of length I — h and any suffix of length h. For the assemblies we use the 
same geometry-based encoding of each bit as [7| , and a distinct set of glues used 
for each bit of the assembly encoding both the bit index and carry bit value from 
the previous bit. 

Left and right bins. We build a mix graph (seen in Figure |6| consisting 
of two disjoint paths of bins (called left bins and right bins) that are used to 
iteratively assemble partial counter rows i and j by the addition of distinct 
constant-sized assemblies for each bit. The partial rows are used to produce the 
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0000 0001 0010 0011 0100 0101 0100 0101 1000 1001 1010 1011 1100 1101 1110 1111 



Fig. 5. The prefix tree 7(0,15) for integers to 2'' — 1 represented in binary. Tlie bold 
subtree is tlie prefix subtree Tf^5,i4) for integers 5 to 14. 



assemblies in the subtree and missing bit bins (described next). In the suffix 
trees, the bit strings of these assemblies are progressively longer subpaths of the 
leftmost and rightmost paths in the subtree of binary strings of the integers i to 
J- 

Subtree bins. Assemblies in subtree bins correspond to assemblies encoding 
prefixes of binary counter row values. However, unlike left and right bins that 
encode prefixes of only a single value, subtree bins encode prefixes of many 
binary counter values between i the j - namely a set of values forming a maximal 
complete subtree of the subtree of binary strings of integers from i to j, hence 
the name subtree bins. For example, if « = 12 and j = 16, then the set of binary 
strings for values 12 (OllOOf,) to 15 (01111b) have a common prefix 011^. In this 
case a subtree bin containing an assembly encoding the three bits Oil would 
be created. Since there are at most 21ogn such complete subtrees, the number 
of subtree bins is at most this many. Creating each bin only requires a single 
mixing step of combining an assembly from a left or right bin with a single bit 
assembly, for example adding a 1-bit assembly to the left bin assembly encoding 
the prefix Olf,. 

Missing bit bins. To add the bits not encoded by the assembhes in the 
subtree bins, we create sets of four constant-sized assemblies in individual miss- 
ing bit bins. Since the assemblies in subtree bins encode bit string prefixes of 
sets of values forming complete subtrees, completing these prefixes with any 
suffix forms a bit string whose value is between i and j. This allows complete 
non-determinism in the bit assemblies that attach to complete the counter row, 
provided they properly handle carry bits. For every bit index missing in some 
subtree bin assembly, the four assemblies encoding the four possibilities for the 
input and carry values are assembled and placed into separate bins. When all 
bins are mixed, subtree assemblies mix non-deterministically with all possible as- 
semblies from missing bit bins, producing all counter rows whose binary strings 
are found in the subtree. In total, up to 41ogn missing bit bins are created, 
and each contains a constant-sized assembly and so requires constant work to 
produce. 
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Fig. 6. The mix graph constructed for the prefix subtree 7(5,14) seen in Figure [51 



The total number of total bins is clearly O(logn). Consider mixing the left 
and right bins containing completed counter rows for i and j, all subtree bins, and 
all missing bit bins. Any assembly produced by the system must be a complete 
binary counter row, as all assemblies are either already complete rows (left and 
right bins) or are partial assemblies (subtree bins and missing bit bins) that can 
be extended towards the end of the bit string by missing bit bin assemblies, or 
towards the start of the bit string by missing bit and then subtree bin assemblies. 

The second counter generalization is incrementing by non-unitary values: 

Lemma 4. Let k, n be integers such that < k < n and n = 2™. There exists a 
SSAS of size O(logn) with a set of bins that, when mixed, assemble a set o/2™ 
binary counter rows with values 0, 1, ... , 2™ — 1 incremented by k. 

Proof. For each row, the incremented value of the 6th bit of the row depends on 
three values: the previous value of the &th bit, the carry bit from the (6 — l)st 
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addition, and the 6th bit of k. The resuhing output is a pair of bits: the resulting 
value of the 6th bit and the 6th carry bit (seen in Table IT]). 
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Table 1. All bit combinations for a binary adder incrementing n by k. 



Create a set of four 0(l)-tile subassemblies for each bit of the counter, se- 
lecting from the first or second half of the combinations in Table [T] resulting in 
41ogn assemblies total. Each subassembly handles a distinct combination of the 
6th bit value of the previous row, (6 — l)st carry bit, and 6th bit value of k by 
encoding each possibility as a distinct glue. When mixed in a single bin, these 
subassemblies combine in all possible combinations and producing all counter 
rows from to 2™ — 1. 

Lemma 5. Let i,j,k,n be integers such that < i < j < n and < k < n. 
There exists a SSAS of size O(logn) with a set of bins that, when mixed, assemble 
a set of j — i + 1 binary counter rows with values i,i + l, . . . ,j incremented by k. 

Proof. Combine the constructions used in the proofs of Lemmas [3] and |4] by using 
mixing sequences as in the proof of Lemma |3] and sets of four subassemblies 
encoding input, carry, and increment bit values as in the proof of Lemma [4] 

Theorem 8 of Demaine et al. [7] describes how to reduce the number of glues 
used in a system by replacing each tile with a large macrotile assembly, and 
encoding the tile's glues via unique geometry on the macrotile's sides. We prove 
a similar result for labeled tiles, used for proving Theorems [2J [3J and [7] 

Lemma 6. Any mismatch-free t — \ SAS (or SSAS) S — (T, G, r, M) can 
be simulated by a SAS (or SSAS) S' at t = 1 with 0(1) glues, system size 
0{S{T)\T\ + \S\), and 0(log \G\) scale. 

Proof. The proof is constructive. Produce a set of north macroglue assemblies 
for the glue set: 0(log \G\) x 0(1) assemblies, each encoding the integer label of 
a glue i via a sequence of bumps and dents along the north side of the assembly 
representing the binary sequence of bits for i, as seen in Figure [71 All north 
macroglue assemblies share a pair of common glues: an inner glue on the west 
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end of the south side of the assembly (green in Figure u\ and an outer glue on 
the west end of the north side of the assembly (blue in Figure [7]) . The null glue 
also has the sequence of bumps and dents (encoding 0), but lacking the outer 
glue. Repeating this process three more times yields sets of east, west, and south 
macroglue assemblies. 
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Fig. T. Converting a tile in a system with 7 glues to a macrotile with 0(log jG|) scale 
and 3 glues. The gray label of the tile is used as a label for all tiles in the core and 
macroglue assemblies, with the 1 and markings for illustration of the glue bit encod- 
ing. 
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For each label / G S{T), repeat the process of producing the macroglue 
assemblies once using a tile set exclusively labeled I. Also produce a square 
0(log|G|) X ©(log I G I) core assembly, with a single copy of the inner glue on 
the counterclockwise end of each face. Use the macroglue and core assemblies 
to produce a set of macrotiles, one for each i G T, consisting of a core assembly 
whose tiles have the label of t, and four glue assemblies encode the four glues of 
t and whose tiles have the label of t. Extend the mix graph M' of S' by carrying 
out the mixings of M but starting with the equivalent macrotiles. Define the 
simulation function / to map each macrotile to the label found on the macrotile, 
and the function g to take the portion of M' and g to be the portion of the mix 
graph carrying out the mixings of S. 

The work done to produce the glue assemblies is 0{S{T)\G\), to produce the 
core assemblies is 0(Z'(T) log log |G|), and to produce the macrotiles is 0(|T|). 
Carrying out the mixings of S requires 0(|5|) work. Since each macrotile is 
used in at least one mixing simulating a mixing in S, \T\ < \S\. Additionally, 
\G\ < 4|r|. So the total system size is 0(i:(r)|G|-hi:(r) log log|G|-H|T|-^|5|) = 
0{U{T)\T\ + \S\). 

Armed with these tools, we are ready to convert PCFGs into SSASs. Recall 
that in Section |4] we showed that in the worst case, converting a PCFG into a 
SSAS (or SAS) must incur an J7(logn/loglogn)-factor increase in system size. 
Here we achieve a 0(logn)-factor increase. 

Theorem 2. For any polyomino P with \P\ — n derived by a PCFG G, there 
exists a SSAS S with \S\ = 0(|G| logn) producing an assembly with label poly- 
omino P' , where P' is a {O {log n),0{n))- fuzzy replica of P. 

Proof. We combine the macrotile construction of LemmalGl the generalized coun- 
ters of LemmalSJ and a macrotile assembly invariant that together enable efficient 
simulation of each production rule in a PCFG by a set of O(logn) mixing steps. 

Macrotiles. The macrotiles used are extended versions of the macrotiles 
in Lemma [6] with two modifications: a secondary, resevoir macroglue assembly 
on each side of the tile in addition to a primary bonding macroglue, and a thin 
cage of dimensions 0{n) x 0(logn) surrounding each resevoir macroglue (see 
Figure [8|. 

Mixing a macrotile with a set of bins containing counter row assemblies con- 
structed by Lemma [5] causes completed (and incomplete) counter rows to attach 
to the macrotile's macroglues. Because each macroglue's geometry matches the 
geometry of exactly one counter row, a partially completed counter row that 
attaches can only be completed with bit assemblies that match the macroglue's 
value. As a result, mixing the bin sets of Lemma [S] with an assembly consisting 
of macrotiles produces the same set of products as mixing a completed set of 
binary counter rows with the assembly. 

An attached counter row effectively causes the macroglue's value to change, 
as it presents geometry encoding a new value and covers the macroglue's previous 
value. The cage is constructed to have height sufficient to accomodate up to n 
counter rows attached to the reservoir macroglue, but no more. 
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Fig. 8. A macrotile used in converting a PCFG to a SAS, and examples of value 
maintenance and offset preparation. 



Because of the cage, no two macrotiles can attach by their bonding macroglues 
unless the macroglue has more than n counter rows attached. Alternatively, one 
can produce a thickened counter row with thickness sufficient to extend beyond 
the cage. We call such an assembly a macroglue activator^ as it "activates" a 
bonding macroglue to being able to attach to another promoted macroglue on 
another macrotile. Notice that a macroglue activator will never attach to a bond- 
ing macroglue's resevoir twin, as the cage is too small to contain the activator. 

An invariant. Counter rows and activators allow precise control of two 
properties of a macrotile: the identities of the macroglues on each side, and 
whether these glues are activated. In a large assembly containing many macroglues, 
the ability to change and activate glues allows precise encoding of how an as- 
sembly can attach to others. In the remainder of the construction we maintain 
the invariant that every macrotile has the same glue identity on all four sides, 
and any macrotile assembly consists of macrotiles with glue identities forming a 
contiguous interval, e.g. 4, 5, 6, 7. Intervals are denoted [i,i'], e.g. [34,57]- 

By LemmalS] a set of row counters incrementing the glue identities of all glues 
on a macrotile can be produced using O(logn) work. Activators, by virtue of 
being nearly rectangular with 0(log n) cells of bit geometry can also be produced 
using O(logn) work. 
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Production rule simulation. Consider a PCFG with non-terminal N 
and production rule N — > {Ri, (xi, yi)){R2, {x2, J/2)) £^nd a SSAS with two bins 
containing assemblies Ai, A2 with the label polyominocs of Ai and A2 being 
fuzzy replicas of the polyominoes derived by i?i and i?2- Also assume Ai and A2 
are assembled from the macrotiles just described, including the invariant that 
the identities of the glues on Ai and A2 are identical on all sides of a macrotile 
and contiguous across the assembly, i.e. the identities of the glues are [ii, ji] and 
[*2i.?2] on assemblies Ai and ^27 respectively. 

Select two cells c^^^ , c^^, in the polyominoes derived by i?i and R2 adjacent in 
polyomino derived by N . Define the glue identities of the two macrotiles forming 
the supercells mapped to cr-^ and cji^ to be gi and 172 • Then the glue sets on Ai 
and A2 can be decomposed into three subsets [ii^gi — 1], [gi], [gi + I7J1] and 
[^2:32 ~ 1]; [32] J [52 + Ij J2]i respectively. We change these glue values in three 
steps: 

1. Construct two sets of row counters that increment ii through gi by ji — ii + l 
and 12 through 172 by 52^^2 + 1; and mix them in separate bins with Ai and A2 
to produce two new assemblies A\ and A'2. Assemblies A\ and A'2 have glues 
[gi -\- 1, gi + ji — ii -I- 1] and [32, 32 + J2 — ^2], respectively, and the macroglues 
with values gi and g2 now have values g'l — .91 + (.91 — ii)+.7i + l and 93 = 92, 
i.e. the glues of A'^ and A'^ are [g[ - (ji - ii),g'i] and [53,52 + J2 - «2]- 

2. Construct a set of row counters that increment the values of all glues on 
A'2 by 52 ^ 51 + 1 if this value is positive, and mix the counters with A^ to 
produce Ag. Then the macroglue with value 52 now has value 52 =5^-1-1 
and the glue values of A'^ and A'2 are [g'l — (ji — Ji), 5i] and [92 1 52 + J2 — «2]- 

3. Construct a pair of macroglue activators with values g'l and 52 that attach 
to the pair of macroglue sides matching the two adjacent sides of cells cr-^ 
and C/J2- I'^'fix each activator with the corresponding assembly A'^ or Aj. 

Mixing A'-y and A'2 with the pair of activated macroglues causes them to 
bond in exactly one way to form a superassembly A-^ whose label polyomino is 
a fuzzy replica of the polyomino derived by N . Moreover, the glue values of the 
macrotiles in A3 are [51 — (ji — *i), 52+^2— ^2], maintaining the invariant. Because 
each macrotile has a resevoir macroglue on each side, any bonding macroglue 
with an activator already attached has a resevoir macroglue that acceptsthe 
matching row counter, so each mixing has a single product and specifically no 
row counter products. 

System scale The PCFG P contains at most n production rules. Also, 
each step shifts glue identities by at most n (the number of distinct glues on the 
macrotile), so the largest glue identity on the final macrotile assembly is n^ . So 
we produce macrotiles with core assembhes of size O(logn) x O(logri) and cages 
of size 0{n). Assembling the core assemblies, cages, and initial macroglue assem- 
bhes of the macrotiles takes 0{\P\ logn-|-logn-|-logn) ~ 0{\P\ logn) work, dom- 
inated by the core assembly production. Simulating each production rule of the 
grammar takes O(logn) work spread across a constant number of 0(logn)-sized 
sequences of mixings to produce sets of row counters and macroglue activators. 
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Applying Lemina p^ to the construction (creating macrotiles of macrotiles) 
gives a constant-glue version of the previous theorem: 

Theorem 3. For any polyomino P with \P\ = n derived by a PCFG G, there 
exists a SSAS S' using 0(1) glues with \S'\ = 0(|G| logrt) producing an assembly 
with label polyomino P' , where P' is a (O (log nlog\ogn),0{nlog\ogn)) -fuzzy 
replica of P. 

Proof. The construction of Theorem[2]uses 0(log n) glues, namely for the counter 
row subconstruction of Lemma [5) With the exception of the core assemblies, all 
tiles of S have a common fuzz (gray) label, so creating macrotile versions of these 
tiles and carrying out all mixings involving these macrotiles and completed core 
assemblies is possible with 0(1-|T| + |5|) — 0{\S\) mixings and scale O(loglogn). 
Scaled core assemblies of size ©(n log log n) x ©(n log log n) can be constructed 
using constant glues and 0(log(T7,loglogn)) = O(logn) mixings, the same num- 
ber of mixings as the unsealed 0{n) x 0{n) core assemblies of Theorem [2J So 
in total, this modified construction has system size OdS*!) — 0(|G|logn) and 
scale O (log log n). Thus it produces an assembly with label polyomino that is a 
(O (log n log log n),0(n log log n))-fuzzy rephca of P. 

The results in this section and Section|4]achieve a "one-sided" correspondence 
between the smallest PCFG and SSAS encoding a polyomino, i.e. the smallest 
PCFG is approximately an upper bound for the smallest SSAS (or SAS). Since 
the separation upper bound proof (Theorem ^ is constructive, the bound also 
yields an algorithm for converting a a PCFG into a SSAS. 



6 PCFG over SAS and SSAS Separation Lower Bound 

Here we develop a sequence of PCFGs over SAS and SSAS separation results, all 
within a polylogarithmic factor of optimal. As utilized in Theorem [Tj the results 
in this section also hold for polynomially scaled versions of the polyominoes, 
showing that the "fuzziness" permitted in Section [5] was not unfair. 

6.1 General shapes 

In this section we describe an efficient system for assembling a set of shapes we 
call weak counters. An example of a rows in the original counter and macrotile 
weak counter are shown in Figure [T0| These shapes are macrotile versions of the 
doubly-exponential counters found in [7] with three modifications: 

1. Each row is a single path of tiles, and any path through an entire row 
uniquely identifies the row. 

2. Adjacent rows do not have adjacent pairs of tiles, i.e. they do not touch. 

3. Consecutive rows attach at alternating (east, west, east, etc.) ends. 
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Fig. 9. Two-bit examples of the weak (left), end-to-end (upper right), and block (lower 
right) binary counters used to achieve separation of PCFGs over SASs and SSASs in 
Section |6l 



Figure[TT]shows three consecutive counter rows attached in the final assembly. 
Each row of the doubly-exponential counter consists of small, constant-sized 
assemblies corresponding to or 1 values, along with a or 1 carry bit. We 
implement each assembly as a unique path of tiles and assemble the counter as 
in (7j , but using these path-based assemblies in place of the original assemblies. 
We also modify the glue attachments to alternate on east and west ends of each 
row. Because the rows alternate between incrementing a bit string, and simply 
encoding it, alternating the attachment end is trivial. Finally, note that adjacent 
rows only touch at their attachment, but the geometry encoded into the row's 
path prevents non-consecutive rows from attaching. 

Lemma 7. There exists a t = 1 SAS of size 0{b) that produces a 2^ -bit weak 
counter. 

Proof. The counter is an 0(l)-scaled version of the counter of Demaine et al [7]. 
They show that such an assembly is producible by a system of size 0{b). 

Lemma 8. For any PCFG G deriving a 2^^ -bit weak counter, \G\ = J?(2^ ). 

Proof. Define a minimal row spanner of row TZi to be a non-terminal symbol A'' 
of G with production rule A'' — >• (i3, (cci, yi))(C, (x2, 2/2)) such that the polyomino 
derived by A'' contains a path between a pair of easternmost and westernmost 
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Fig. 10. Zoomed views of increment (top) and copy (bottom) counter rows described 
in [7| and the equivalent rows of a weak counter. 



tiles of the row and the polyominoes derived by B and C do not. We claim that 
each row (trivially) has at least one minimal row spanner and each non-terminal 
of G is a minimal row spanner of at most one unique row. 

First, suppose by contradiction that a non-terminal iV is a minimal row 
spanner for two distinct rows. Because N is connected and two non-adjacent 
rows are only connected to each other via an intermediate row, N must be a 
minimal row spanner for two adjacent rows TZt and TZi+i. Then the polyominoes 
of B and C each contain tiles in both TZi and TZi+i, as otherwise either C or B 
is a minimal row spanner for TZi or TZt+i- 

Without loss of generality, assume B contains a tile at the end of TZi not 
adjacent to TZi+i. But B also contains a tile in TZi+i and (by definition) is 
connected. So B contains a path between the east and west ends of row TZi, and 
thus A'^ is a not a minimal row spanner for r^. So A'^ is a minimal row spanner 
for at most one row. 
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Fig. 11. Adjacent attached rows of the counter described in [^ (top) and the equivalent 
rows in the weak counter (bottom). 



Next, note that the necessarily-serpentine path between a pair of easternmost 
and westernmost tiles of a row in a minimal row spanner uniquely encodes the 
row it spans. So the row spanned by a minimal row spanner is unique. 

Because each non-terminal of G is a minimal row spanner for at most one 
unique row, G must have at least 2^ non-terminal symbols and total size i7(2^ ). 

Theorem 4. The separation of PC'FGs over t — \ SASs for single-label poly- 
omines is i7{n/ (loglogn)^). 

Proof. By the previous two lemmas, there exists a SAS of size 0{b) producing 
a 6-bit weak counter, and any PCFG deriving this shape has size /2(2^ ). The 
assembly itself has size n = 0{2^ b), as it consists of 2^ rows, each with b sub- 
assemblies of constant size. So the separation is Q{{n/b)/h) = i7(n/(loglogn)^). 

In [7j, the 0(loglogn)-sized SAS constructing a log n- bit binary counter re- 
peatedly doubles the length of each row (i.e. number of bits in the counter) 
using 0(1) mixings per doubling. Achieving such a technique in a SSAS seems 
impossible, but a simpler construction producing a &-bit counter with 0{b) work 
can be done by using a unique set of 0(1) glues for each bit of the counter. In 
this case, mixing these reusable elements along with a previously-constructed 
pair of first and last counter rows creates a single mixing assembling the entire 
counter at once. Modifying the proof of Theorem|4]to use this construction gives 
a similar separation for SSASs: 

Corollary 1. The separation of PCFGs over t — 1 SSASs for single-label poly- 
ominoes is Q{n/{\ogn)'^). 
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6.2 Rectangles 

For the weak counter construction, the lower bound in LemmalHldepended on the 
poor connectivity of the weak counter polyomino. This dependancy suggests that 
such strong separation ratios may only be achievable for special classes of "weakly 
connected" or "serpentine" shapes. Restricting the set of shapes to rectangles or 
squares while keeping an alphabet size of 1 gives separation of at most O(logn), 
as any rectangle of area n can be derived by a PCFG of size O(logn). But 
what about rectangles with a constant-sized alphabet? In this section we achieve 
surprisingly strong separation of PCFGs over SASs and SSASs for rectangular 
constant-label polyominoes, nearly matching the separation achieved for single- 
label general polyominoes. 

The construction The polyominoes constructed resemble binary counters whose 
rows have been arranged in sequence horizontally, and we call them b-bit end- 
to-end counters. Each row of the counter is assembled from tall, thin macrotiles 
(called bars), each containing a color strip of orange, purple, or green. The color 
strip is coated on its east and west faces with gray geometry tiles that encode 
the bar's location within the counter. 




Fig. 12. The rectangular polyomino used to show separation of PCFGs over SASs 
when constrained to constant-label rectangular polyominoes. The green and purple 
color strips denote and 1 bits in the counter. 



Each row of the counter has a sequence of green and purple display bars 
encoding a binary representation of the row's value and flanked by orange reset 



bars (see Figure 13). An example for & = 2 bits can be seen in Figure 12 

Each bar has dimensions 0(1) x 3(log2 6-1-6 + 2), sufficient for encoding two 
pieces of information specifying the location of the bar within the assembly. The 
row bits specify which row the bar lies in (e.g. the 7th row). The subrow bits 
specify where within the row the bar lies (e.g. the 4th bit). The subrow value 
starts at on the east side of a reset bar, and increments through the display 
bars until reaching 6 -I- 1 on the west end of the next reset bar. Bars of all three 
types with row bits ranging from to 2** — 1 are produced. 

Efficient assembly The counter is constructed using a SAS of size 0(b) in two 
phases. First, sequences of 0{b) mixings are used to construct five families of 
bars: 1. reset bars, 2. 0-bit display bars resulting from a carry, 3. 0-bit display 
bars without a carry, 4. 1-bit display bars resulting from a carry, 5. 1-bit display 
bars without a carry, The mixings product five bins, each containing all of the 
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Fig. 13. The implementation of the vertical bars in row 2 (Olt) of an end-to-end 
counter. 



bars in the family. These five bins are then combined into a final bin where the 
bars attach to form the 0(2^) x 0{b) rectangular assembly. The five families are 
seen in Figure [14] 

Efficient 0{b) assembly is achieved by careful use of the known approach of 
non-deterministic assembly of single-bit assemblies as done in [J. Assemblies 
encoding possible input bit and carry bit value combinations for each row bit 
and subrow bit are constructed and mixed together, and the resulting products 
are every valid set of input and output bit strings, i.e. every row of a binary 
counter assembly. 

As a warmup, consider the assembly of all reset bars. For these bars, the west 
subrow bits encode b and the east subrow bits encode 0. The row bits encode a 
value i on the west side, and i -|- 1 on the east side, for all i between and 2'' — 1. 
Constructing all such bars using 0{b) work is straightforward. For each of the 
log2 6-1-1 subrow bits, create an assembly where the west and east bits are 1 and 
respectively, except for the most significant bit (bit logj 6-1-1), where the west 
and east bits are both 0. 

For the row bits we use the same technique as in [7] and extended in LemmaH 
create a constant-sized set of assemblies for each bit that encode input and output 
value and carry bits. For bits 1 through 6—1 (zero-indexed) create four assemblies 
corresponding to the four combinations of value and carry bits, for bit create 
two assemblies corresponding to value bits (the carry bit is always 1), for bit 
6 create three assemblies corresponding to all combinations except both value 
and carry bits valued 1, and for bit 6 + 1 create a single assembly with both 
bits valued 0. Give each bit assembly a unique south and north glue encoding its 
location within the bar and carry bit value, and give all bit assemblies a common 
orange color strip. Mixing these assemblies produces all reset bars, with subrow 
west and east values of 6 and 0, and row values i and i -I- 1 for all i from to 
2''-l. 
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Fig. 14. The decomposition of bars used assemble a 6-bit end-to-end counter. 



In contrast to producing reset bars, producing display bars is more difficult. 
The challenge is achieving the correct color strip relative to the subrow and row 
values. Recall that the row value i locates the bar's row and the subrow value 
j locates the bar within this row. So the correct color strip for a bar is green if 
the jth bit of i is 0, and purple if the jth bit of i is 1. 

We produce four families of display bars, two for each value of the jth bit 
of of i. Each subfamily is produced by mixing a subrow assembly encoding j 
on both east and west ends with three component assemblies of the row value: 
the least significant bits (LSB) assembly encoding bits 1 through j — I oi i, the 
most significant bits (MSB) assembly encoding bits j + 1 through 6 of i, and the 
constant-sized jth bit assembly. This decomposition is seen in the bottom half 
of Figure [M] 

The four families correspond to the four input and carry bit values of the jth 
bit. These values determine what collections of subassemblies should appear in 
the other two components of the row value. For instance, if the input and carry 
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bit values are both 1, then the LSB assembly must have all I's on its west side 
(to set the jth carry bit to 1) and all O's on its east side. Similarly, the MSB 
assembly must have some value p encoded on its west side and the value p + 1 
encoded on its east side, since the jth bit and and jth carry bit were both 1, so 
the (j + l)st carry bit is also 1. 

Notice that each of the four families has b subfamilies, one for each value 
of j. Producing all subfamilies of each family is possible in 0{b) work by first 
recursively producing a set of b bins containing successively larger sets of MSB 
and LSB assemblies for the family. Then each subfamily can be produced using 
0(1) amortized work, mixing one of b sets of LSB assembly subfamilies, one of 
b sets of MSB assemblies, and the jth bit assembly together. For instance, one 
can produce the set of b sets of MSB assemblies encoding pairs of values p and 
p-l- 1 on bits b—1 through b, b — 2 through b, etc. by producing the set on bits k 
through b, then adding four assemblies to this bin (those encoding possible pairs 
of inputs to the {k — l)st bit) to produce a similar set on bits fc — 1 through b. 

Lemma 9. There exists a t = 1 SAS of size 0(b) that produces a b-bit end-to- 
end counter. 

Proof. This follows from the description of the system. The five families of bars 
can each be produced with 0(5) work and the bars can be combined together in 
a single mixing to produce the counter. So the system has total size 0{b). 

Lemma 10. For any PCFG G deriving a b-bit end-to-end counter, \G\ = fi{2^). 



Proof. Let G be a PCFG deriving a 6-bit end-to-end counter. Define a mini- 
mal row spanner to be a non-terminal symbol N with production rule N — ^ 
(i?, (a;i,j/i))(C, (0:2,2/2)) such that the polyomino derived by N (denoted piq) 
horizontally spans the color strips of all bars in row TZi including the reset bar 
at the end of the row, while the polyominoes derived by B and C (denoted 
Pb and pc) do not. Consider the bounding box D of these color strips (see 



Figure 15) 
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Fig. 15. A schematic of the proof that a non-terminal is a minimal row spanner for 
at most one unique row. (Left) Since pB and pc can only touch in D, their union 
non-terminal A^ must be a minimal row spanner for the row in D. (Right) The row's 
color strip sequence uniquely determines the row spanned by N (01b). 
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Without loss of generality, pB intersects the west boundary of D but does 
not reach the east boundary, while pc intersects the east boundary but does not 
reach the west boundary, so any location at which pg and pc touch must lie in 
D. Then any row spanned by pN and not spanned by ps or pc must lie in Z?, 
since spanning it requires cells from both pB and pc- So pn is a minimal row 
spanner for at most one row: row TZi. 

Because the sequence of green and purple display bars found in D is distinct 
and separated by display bars in other rows by orange reset bars, each minimal 
row spanner spans a unique row TZi ■ Then since each non-terminal is a spanner for 
at most one unique row, G must have 2^ non-terminal symbols and \G\ ~ fl{2^). 

Theorem 5. The separation of PCFGs overr — 1 SASs for constant-label rect- 
angles is Q{n/(\ogn)^). 

Proof. By construction, a &-bit end-to-end counter has dimensions 0(2^6) x 0{b). 
So n — 0{2^b^) and b = 0{\ogn). Then by the previous two lemmas, the sepa- 
ration is Q{{n/b'^)/b) ^ Q{n/{\ogn)^). 

We also note that a simple replacement of orange, green, and purple color 
strips with distinct horizontal sequences of black/white color substrips yields the 
same result but using fewer distinct labels. 

6.3 Squares 

The rectangular polyomino of the last section has exponential aspect ratio, sug- 
gesting that this shape requires a large PCFG because it approximates a pat- 
terned one- dimensional assemblies reminiscent of those in [5] . Creating a poly- 
omino with better aspect ratio but significant separation is possible by extending 
the polyomino's labels vertically. For a square this approach gives a separation of 
PCFGs over SASs of n{y/n/ logn), non-trivial but far worse than the rectangle. 

The construction In this section we describe a polyomino that is square but 
contains an exponential number of distinct subpolyominoes such that each sub- 
polyomino has a distinct "minimal spanner" , using the language of the proof of 



Lemma 10 These subpolyominoes use circular versions of the vertical bars of 
the construction in Section [6?2] arranged concentrically rather than adjacently. 
We call the polyomino a b-bit block counter, and an example for & = 2 is seen in 
Figure [I6j 

Each block of the counter is a 0{b^) x 0{b'^) square subpolyomino encoding 
a sequence of b bits via a sequence of concentric rectangular rings of increasing 
size. Each ring has a color loop encoding the value of a bit, or the start or end 
of the bit sequence (the interior or exterior of the block, respectively). The color 
loop actually has three subloops, with the center loop's color (green, purple, light 



blue, or dark blue in Fig. 16 1 indicating the bit value or sequence information. 



and two surrounding loops (light or dark orange in Fig. 16 1 indicating the interior 
and exterior sides of the loop. 
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D 



D 



Fig. 16. The square polyomino used to siiow separation of PCFGs over SASs when 
constrained to constant-label square polyominoes. The green and purple color subloops 
denote and 1 bits in the counter, while the light and dark blue color subloops denote 
the start and end of the bit string. The light and dark orange color subloops indicate 
the interior and exterior of the other subloops. 



Efficient assembly of blocks Though each counter block is square, they are con- 
structed similarly to the end-to-end counter rows of Section |6.2| by assembling 
the vertical bars of each ring together into horizontal stacks of assemblies. Hor- 
izontal slabs are added to "fill in" the remaining portions of each block. 

The bars are identical to those found in Section [6^2] with three modifications 
(seen in Figure [T7|). First, each bar has additional height according to the value 
of the subrow bits (8 tiles for every increment of the bits). Second, each bar has 
four additional layers of tiles on the side (east or west) facing the interior of the 
block, with color bits at the north and south ends of the side encoding three 
values: 11;, (if the center color subloop is purple, a 1-bit), OOf, (if the center color 
subloop is green, a 0-bit), or 01b (if the center color subloop is dark blue, the 
end of the bit sequence) . The additional layers are used to fill in gaps between 
adjacent rings left by protruding geometry, and the bit values are used to control 
the attachment of the horizontal slabs of each ring. 

Third, the reset bars used in Sect ion [6. 2 [ are replaced with two kinds of bars: 
start bars and end bars, seein in Figure |19[ End bars form the outermost rings 
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Fig. 17. The implementation of rings in each block of the block counter. 



of each block, and the start bars form the square cores of each block. Both start 
and end bars "reset" the subrow counters, and the east end bars increment the 
row value. 

Recall that the vertical bars of the end-to-end counter in Section 16.21 were 
constructed using 0{b) total work by amortizing the constructing subfamilies of 
MSB and LSB assemblies for each subrow value j. We use the same trick here 
for these assemblies as well as the new assemblies on the north and south ends 
of each bar containing the color bits. In total there are twelve families of vertical 
bar assemblies (four families of west display bars, four families of east display 
bars, and two families each of start and end bars), and each is assembled using 
0(6) work. 

Finally, the horizontal slabs of each ring are constructed as six families, each 



using 0{b) work, as seen in Figure 20 



Efficient assembly of the counter Once the families of vertical bars and horizontal 
slabs are assembled into blocks, we are ready to arrange them into a completed 
counter. Each row of the counter has v2^ = 2^"/^ blocks. So assuming b is even, 
the 6/2 least significant bits of the westmost block of each row are O's, and of 
the eastmost block are I's. Before mixing the vertical bar families together, we 
"cap" the east end bar of each block at the east end of a row by constructing a 
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Fig. 18. The decomposition of vertical display bars used to assemble blocks in the 6-bit 
block counter. Only the west bars are shown, with east bars identical but color bits 
and color loops reflected. 



set of thin assemblies (right part of Figure 21 1 and mixing them with the family 
of east end bars. 

After this modification to the east end bar family, mixing all vertical bar 
families results in 2''/^ assemblies, each forming most of a rovi^ of the block 
counter. Mixing these assemblies with the families of horizontal slabs results in a 
completed set of block counter rows, each containing 2''/^ square assemblies with 
dimensions 6>(6^) x 0{b^), forming 2^/^ rectangles with dimensions 0(2^/^6^) x 

To arrange the rows vertically into a complete block counter, a vertically- 



oriented version of the end-to-end counter of Section 6.2 with geometry instead 



of color strips (left part of Fig. 21 1 is assembled and used as a "backbone" for 
the rows to attach into a combined assembly. This modified end-to-end counter 



(see Figure 22) has subrow values from to 6/2, for the 6/2 most signficant bits 
of the row value of each block, and row values from to 2^/^. Modified versions 
of reset bars with height (width in the horizontal end-to-end counter) 0(6^) are 
used to bridge across the geometry-less portions of the west sides of the blocks, 
as well as the always-zero 6/2 least significant bits of the block's row value and 
subrow log2 6 bits. 

This modified end-to-end counter can be assembled using 0{b) work as 
done for the original end-to-end counter, since the longer reset bars only add 
0(log(6^)) — 0(log6) work to the assembly process. After the vertical end-to- 
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Fig. 19. The decomposition of vertical start and end bars used to assemble blocks in 
the 6-bit block counter. 




end counter has been combined with the blocks to form a complete block counter, 
a horizontal end-to-end counter is attached to the top of the assembly to produce 
a square assembly. 

Lemma 11. For even h, there exists a t — \ SAS of size 0{b) that produces a 
b-bit block counter. 

Proof. The construction described builds families of vertical bars and horizontal 
slabs that are used to assemble each the rings forming all blocks in the counter. 
There are a constant number of families, and each family can be assembled 
using 0{b) work. The vertical and horizontal end-to-end counters can also be 
assembled using 0(6) work each by Lemma l9J Then the 6-bit block counter can 
be assembled by a SAS os size 0{b). 

We now consider a lower bound for any PCFG G deriving the counter, using 
a similar approach as Lemma [TOl 

Lemma 12. For any PCFG G deriving a b-bit block counter, \G\ = ^2(2''). 

Proof. Define a minimal block spanner as to be a non-terminal symbol N in 
G with production rule N — )• {B, {xi,yi)){G, {x2, 2/2)) such that the polyomino 
derived by N (denoted pn) contains a path from a gray cell outside the color 
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Fig. 20. The decomposition of horizontal slabs of each ring the 6-bit block counter. 



loop of the end ring of the counter to a gray cell inside the start color loop of 
the counter, and the polyominoes derived by B and C (denoted pb and pc) do 
not. 

First we show that any minimal block spanner is a spanner for at most one 
block. Assume by contradiction and that A'^ is a minimal block spanner for two 
blocks Bi and Bj and that ps contains a gray cell inside the start color loop of 
Bi. Then B must be entirely contained in the color loop of the end ring of Bi, as 
otherwise A^ is not a minimal block spanner for Bi. Similarly, C must then be 
entirely contained in the color loop of the end ring of Bj . Since no pair of color 
loops from distinct blocks have adjacent cells, pn is not a connected polyomino 
and so G is not a valid PCFG. 

Next we show that the block spanned by A^ is unique, i.e. A^ cannot be reused 
as a minimal spanner for multiple blocks. See Figure |23| Let A^ be a minimal 
spanner for a block Bi and p be a path of cells in p^ starting at a gray cell 
contained in the start ring of Bi and ending at a gray cell outside the end ring 
of Bi . Consider a traversal of p, maintaining a stack containing the color loops 
crossed during the traversal. Crossing a color loop from interior to exterior (a 
sequence of dark orange, then green, purple, or blue, then light orange cells) adds 
the center subloop's color to the stack, and traversing from exterior to interior 
removes the topmost element of the stack. 

We claim that the sequence of subloop colors found in the stack after travers- 
ing an end ring from interior to exterior encodes a unique sequence of display 
rings and thus a unique block. To see why, first consider that the color loop of 
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Fig. 21. (Left) The interaction of a vertical end-to-end counter witli the westernmost 
block in each row. (Right) The cap assemblies built to attach to the easternmost block 
in each row. 





every ring forms a simple closed curve. Then the Jordan curve theorem implies 
that entering or leaving each region of gray cells between adjacent color loops 
requires traversing the color loop. Then by induction on the steps of p, the stack 
contains the set of rings not containing the current location on p in innermost 
to outermost order. So the stack state after exiting the exterior of the end ring 
uniquely identifies the block containing p and A^ is a minimal spanner for this 
unique block. 

Since there are 2'' distinct blocks in a 6-bit block counter, any PCFG that 
generates a counter has at least 2'' non-terminal symbols and size Qi^). 

Theorem 6. The separation of PCFGs over t — \ SASs for constant-label 
squares is f2 {n / (log n)^). 

Proof. By construction, a 6-bit block counter has size 0{2^lP) — n and so 
6 — 0{\ogn). By the previous two lemmas, the separation is Q{{n/lP')/b) = 
f2{n/{lognf). 

Unlike the previous rectangle construction, it does not immediately follow 
that a similar separation holds for 2-label squares. Finding a construction that 
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Fig. 22. The decomposition of the bars of a vertically-oriented end-to-end counter used 
to combine rows of blocks in a block counter. 



achieves nearly-linear separation but only uses two labels remains an open prob- 
lem. 



6.4 Constant-glue constructions 

Lemma [6] proved that any system S can be converted to a slightly larger system 
(both in system size and scale) that simulates S. Applying this lemma to the 
constructions of Section [6] yields identical results for constant-glue systems: 

Theorem 7. All results in Sectiorum hold for systems with 0(1) glues. 

Proof. Lemma [6] describes how to convert any SAS or SSAS S = (T, G, r, M) 
into a macrotile version of the system S' that uses a constant number of glues, 
has system size 0{S{T)\T\ + \S\), and scale factor 0(log \G\). Additionally, the 
construction achieves matching labels on all tiles of each macrotile, including 
the glue assemblies. Because the labels are preserved, the polyominoes produced 
by each macrotile system S' simulating an assembly system S in Section [6] 
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[start, 0, 1, end] 




Fig. 23. A schematic of the proof that the block spanned by a minimal row spanner is 
unique. Maintaining a stack while traversing a path from the interior of the start ring 
to the exterior of the end ring uniquely determines the block spanned by any minimal 
block spanner containing the path. 



preserves the lower bounds for PCFGs (Lemmas [S] flo] and 12) of each con- 
struction. Moreover, the number of labels in the polyomino is constant and so 
\S'\ = 0(|T| + |5|) — 0{\S\) and the system size of each construction remains the 
same. Finally, the scale of the macrotiles is ©(loglGj) = 0(log |5| = 0(log&), 
so n is increased by a 0((log6)^)-factor, but since n was already exponential 
in b, it is still the case that b = 0{logn) and so the separation factors remain 
unchanged. 



7 Conclusion 



As the results of this work show, efficient staged assembly systems may use a 
number of techniques including, but not limited to, those described by local com- 
bination of subassemblies as captured by PCFGs. It remains an open problem 
to understand how the efficient assembly techniques of Section [5] and Section l6] 
relate to the general problem of optimally assembling arbitrary shapes. 
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